Online Robot Task Switching Under Diminishing Returns
نویسندگان
چکیده
We investigate the task switching problem of a robot maximizing its long-term average rate of return on work performed. We propose an online method to maximize the average gain rate based on only past experience. For that we alter the formulation from optimal foraging theory and recursively include estimates of global task qualities. We demonstrate and analyze our method on a puck-foraging example. In simulation experiments under a variety of conditions we show that our method performs well compared to results obtained by brute force method using post-processed foraging data.
منابع مشابه
Non-Linear Relationships Among Oil Price, Gold Price and Stock Market Returns in Iran: A Multivariate Regime-Switching Approach
In this paper, the effects of oil and gold prices on stock market index are investigated. We use a cointegrated vector autoregressive Markov-switching model to examine the nonlinear properties of these three variables during the period of January 2003 - December 2014. The Markov-switching vector-equilibrium-correction model with three regimes representing "deep recession", "mild recession" and ...
متن کاملA Q-learning Based Continuous Tuning of Fuzzy Wall Tracking
A simple easy to implement algorithm is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. Fuzzy if-then rules provide a reliable decision maki...
متن کاملOnline Interaction in Higher Education: Is There Evidence of Diminishing Returns?
Online interaction is considered to be a key aspect of effective e-learning and improved academic achievement. However, few studies have examined how effectiveness varies with the degree of interaction intensity. Using data for 17,090 students from three Catalan universities, in this paper we study the productivity associated with five different levels of interaction intensity in learning. We a...
متن کاملContingency Enhances Sensitivity to Loss in a Gambling Task with Diminishing Returns.
This study examined whether gambling behavior under conditions of diminishing returns differed between participants with histories of contingent (CD group) and noncontingent (NCD group) token delivery. In Phase 1, CD participants accrued tokens by correctly completing a discrimination task; for NCD participants, token accrual was yoked to token delivery of CD participants. In Phase 2, participa...
متن کاملWorkspace Boundary Avoidance in Robot Teaching by Demonstration Using Fuzzy Impedance Control
The present paper investigates an intuitive way of robot path planning, called robot teaching by demonstration. In this method, an operator holds the robot end-effector and moves it through a number of positions and orientations in order to teach it a desired task. The presented control architecture applies impedance control in such a way that the end-effector follows the operator’s hand with d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010